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Abstract 

We introduce PWP a method for counting indirect influences and compare it with a cou- 
ple of well-known methods. We provide combinatorial as well as probabilistic interpretation 
for the PWP method. 

1 Introduction 

The goal of this note is to compare three alternative approaches to study indirect influences. 
Two of these approaches, namely the MICMAC and PageRank methods, are already well-tested 
and each possesses a host of applications. We believe that the third method PWP, while still in 
its infancy, may also be useful in a variety of contexts. We consider a discrete scenario where 
the variables involved are identified with set [n] = {1, 2, n}. We assume that direct influences 
can be encoded into a M-weighted directed graph without multiple edges with [n] as its set of 
vertices. Thus a direct influence from the variable j on the variable i corresponds with an edge 
from vertex j to vertex i. As usual we encode the graph into the matrix D of direct influences 
such that Dij is the weight of the edge going from j to i, if there is one, and Dij = other- 
wise. Our problem consists in evaluating the indirect influences between vertices, i.e. finding 
the matrix T coding indirect influences. In this work we make a comparative analysis of three 
different definitions for T. 



2 MICMAC 

The MICMAC method introduced by Godet [3J works as follows. Let D be the matrix associated 
with the graph of direct influences. The graph of indirect influences is represented by the 
matrix T = D^, where A; is a fixed small natural number, say 4 or 5. One defines the vector of 
dependencies d = {di, ....dn) and the vector of influences / = (/i, ....fn) , where the coefficients 
dj and fj are given by: 

n n 

dj = Y,'^Ji fj = ^^i3- (1) 

i=l i=l 

Thus {dj, fj) is equal to the (in, oiit)-degree of the vertex j in the graph of indirect influences. 
The numbers dj and fj encode valuable information, for example, the most influential vertex j 
is the one with the highest coefficient fj. 
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3 PageRank 



The celebrated method PageRank |5j for studying indirect dependencies has already revolu- 
tionized the world of internet search engines trough the remarkable success of Google. The 
mathematics encoding the basic structures behind PageRank are however surprisingly simple. 
With hindsight we observe that PageRank differs from MICMAC in three main ideas: 1) Nor- 
malization of influences, i.e. use of column stochastic matrices. 2) Use of complete graphs, 
i.e. graphs whose associated matrix have no vanishing entries. 3) Taking infinite potencies of 
matrices. 

Let D be the n x n matrix of direct influences. It is assumed that the entries of D are 
non-negative real numbers, and that the sum of the entries of each column of D is either or 
1. With PageRank dependencies and influences are determined by the pairs {di, fi) given by 
([1]) where now the matrix T of indirect influences is computed as follows: 

T = hmfc^oo [pD + (1 - p)En] (2) 

where: 1) < p < 1 is a chosen number close to 1, say p = 0.86. 2) D is obtained from D by 
replacing the entries of each zero column of by ^ . 3) is the matrix with entries equal to ^ . 

For the web application D is constructed as follows. Consider the web graph whose set of 
vertices [n] represents the set of all web pages in the world wide web. There is an edge from 
page J to page i in the web graph if and only if a link to page i appears in the content of 
page j. The matrix D of direct influences is such that Dij = ^l^j-^ if there is an edge from j 
to i, and Dij = otherwise; out{j) is the out-degree of the vertex j in the web graph. Since 
pD + (l—p)En is a column stochastic matrix, the vector of influences for T is (1, 1); thus the 
relevant information is now in the vector of dependencies. The entries of the vector ^{di, ...jdn) 
yields the PageRank number of each web page. The greater the PageRank number of a page the 
greater its importance. By construction pD + (1 — p)En is the transition matrix of a primitive 
irreducible Markov chain [3]. Therefore the vector of indirect dependencies d = {di, ■■■,dn) is an 
eigenvector of T with eigenvalue 1, that is Td* = d*. Moreover, the limit ([2|) can be computed 
in terms of d as follows T = (d* d* .... d*) . 



4 PWP 

We introduce the PWP method for counting indirect influences. PWP can be applied to any 
matrix of direct influences, and while MICMAC focuses on paths of length k, and PageRank 
focuses on inflnite long paths, PWP takes into account paths of various lengths. 
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In a nutshell the PWP method can described as follows: given the matrix of direct influences 
D, fixed a real number A > and let T, the matrix of indirect influences, be given by 

1 ^ k 

T = T{D) = -^el^ where = - 1 = ^ 



=1 



Notice that = (^4-^) |a=0! and thus, in principle, we can compute the MICMAC 
matrix of indirect influences from the PWP matrix of indirect influences. As an example, let 
L2 be the graph with vertex set [2] and one edge from vertex 1 to vertex 2. The only non- 
vanishing entry of T is Ti^2 = prri> making vertex 1 the most influential vertex. Also we get 
that vertex 2 does not exert any influence whatsoever over vertex 1. The indirect influence of 
vertex 1 over vertex 2 is given by 

A °° yk 



k=0 

where the coefficients are the so-called Bernoulli numbers. See [2] for an explicit definition 
and a combinatorial interpretation for the Bernoulli numbers. Notice that p^^j is less than 

1. it approaches 1 as A approaches 0, and it approaches as A goes to infinity. Therefore the 
indirect influence that 1 exerts over 2 is lesser than the original direct influence, it approaches 
its original value when A approaches 0, and it is negligible if A is a large number. The direct 
influence that 1 exerts over 2 becomes less relevant as A increases, since with PWP only paths 
of length close to A are really taken into account. 

Our first result states some of the basic properties of the map D — > T{D) and provides a 
probabilistic interpretation for the PWP method. 

Theorem 1. 1. T{D^) = T{D)\ and if D = QCQ'^ then T{D) = QT{C)Q-^. 

2. Let £>! e e M^(M) M„(IR) C M^+„(M), then T{Di D2) = T{Di) T{D2). 

3. If is a column stochastic matrix, then T{D) is a column stochastic matrix. 

4. If [Di, D2] = 0, then we have that 

T{Di + D2) = eXT{D{)T{D2) + T{Di) + T{D2). 

5. If G Af„(M) then T{D) is the expected matrix of 5, where 5 : N+ — > Mn(R) is the 
random matrix given by D{k) = D^, and N+ is the probability space with probability function 
given by p{k) = 

Let P be the Poisson probability on N given by P{k) = 

Proposition 2. 1. For A; > 1 we have that p{k) = ^^x^\ ■ If X is a random variable with 
distribution p, then i^X = ^, EX' = i^^, VX = 
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Therefore, by the Chebyschev's inequahty, we have that 

^2A /\2 I \\„A 



P 



- 1 



c2(e^ - 1) 



In order to provide a combinatorial interpretation for the PWP method we need a few 
definitions [I]. Let M-Set be the category of i?- weighted sets, i.e. the category whose objects 
are pairs (x,u;) where x is a set and u) : x — > M is a map. A morphisms in M-Set from {xi,iOi) 
to {x2,L02) is a map a : xi — > X2 such that = o «• We also consider M-set the full 
subcategory of M-Set whose objects are pairs {x, iv) G R-Set where x is a finite set. M-set is a 
distributive category provided with a valuation map | | : M-set — M given by 



\X,U!\ = L0{i) 



If e is an edge of a directed graph we denote by s(e) and t{e) the starting point and the endpoint 
of e, respectively. A path 7 of length k from a vertex j to a vertex i in a graph is a sequence 
of edges 7 = (71, ....,^k) such that 5(71) = j, t(7j) = s(7i+i) for 1 < i < A; and t(7fc) = i. We 
denote by P{i,j) the set of all paths from j to i, and by Pk{i,j) the set of all paths of length 
k from j to i. 

We assume that the graph of direct influences has associated matrix D. In our applications 
we use the M-weighted sets {Pk{i, j),io), {Pk{i, j), p) and {P{i, j),u!x) where w, p, and uj\ are 
given, respectively, on a path 7 = (71, 7^) in the graph of direct influences by 



(7) = n^*(7.),«(7.)' = n (?'^+ (1 -p)^")t{7.),«(7,) ' ""^(^^ = n^*(70,-(7.) -xrr 

i=l i=l \i=l J ^+'^- 



The following result provides combinatorial interpretation for the MICMAC, PageRank, and 
the PWP methods. 

Theorem 3. 1. If T is the MICMAC matrix of indirect influences, then Tij = j), a;|. 

2. If T is the PageRank matrix of indirect influences, then Tij = lim^.^oo|P^,(i, j), /oj. 

3. If T is the PWP matrix of indirect influences, then Tjj = \P{i, j),uJx\ = Yl'h=i \Pk{h 



5 Examples 

Example 4. Let L„ be the graph with vertex set [n] and with an edge from i to i + 1 for i < n. 
According to MICMAC, for k < n, vertex i will only influence vertex i -\- k. Thus the vertices 
{1, n — /c} are the most influential ones each with influence 1. For k > n the matrix of indirect 
influences vanishes. MICMAC for n = 3, k = A predicts vanishing influences and dependencies. 
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PageRank for n = 3 gives the dependency vector (0.17,0.34,0.47) making vertex 3 the most 
dependent one. PWP for n = 3 and A = 1 gives the vectors / = (1.5, 1, 0) and d = (0, 1, 1.5). 
Vertex 3 is the most dependent and vertex 1 has vanishing dependency. Similarly, MICMAC for 
n = 4, A; = 4 predicts vanishing influences, PageRank dependencies arc (0.12,0.21,0.30,0.37) 
and PWP with A = 1 yields the vectors / = (|, |, 1, 0) and d = (0, 1, |, |). PageRank lowers the 
dependency of vertex 1 to 0.17, 0.12, 0.08, 0.059, 0.046 as n moves from 3 to 7. Thus for large n 
we expect PageRank to produce an almost vanishing dependency for vertex 1, thus approaching 
the PWP's result. PWP for arbitrary A > yields the matrix T given by Tjj^s,j = for 
1 < J < n, 1 < s < n — J, and Tj^gj = otherwise. Notice that for s < n — j — 1 we have that 

Tj+s,j s + l 

thus Tj+s+i,j > Tjj^ii/i if s < A — 1 and Tjj^s+i,j < Tj+s,j if s > A — 1. Therefore Tj^gj achieves 
its maximum, for fixed j, at s = [AJ if A > 1 and at s = 1 otherwise. 

Example 5. Let C„ be the n-cycle graph on [n], i.e. the graph with an edge from i to i + 1 
for i < n and an edge from n to 1. In this case the three methods yield the same vector of 
dependencies, but they do that for quite different reasons. MICMAC makes the influence and 
dependence of each vertex equal to 1. For fixed k vertex i will only influence vertex (i + k) mod 
n. PageRank vector of dependencies is also (1, 1), but in this case it means that each vertex 
is equally dependent on every other vertex. With PWP the matrix T of indirect influences is 
given, for j, s G [n] and j + s taken mod n, by 

1 .."^^ ^nk+s '-"•'^ -j^ 

Tj+s j = — r > — r- , SO for A = 1 we have T,+„ = > — r- 

{nk + s)V ^^^'^ e\ {nk + s)\ 

and therefore we get that Tj+ij > Tj^2,j > ■■■■ > '^j+nj = Tjj, i.e. the total influence and 
dependence of vertex j is 1, and it influences all other vertices, but it has a higher influence 
over the vertices closer to it. 

Example 6. According to properties 1 and 2 of Theorem 1, choosing and appropriated basis, 
one can always reduced the computation of T = T{D) to the case where D is a matrix in Jordan 
canonical form. Thus, we compute T for D a n x n matrix such that Djj = a G C, Dj^ij = 1, 
and Dij = otherwise. Vertex j exerts a non-vanishing indirect influence over the vertices 
j + s whith < s < n — j. Notice that at vertex j a path can either stay at j or move to j + 1, 
thus the MICMAC matrix is given by \Pkij + s,j)\ = {'l)a''~^, and therefore the PWP matrix 
of indirect influences is given by 

1 /k\ A^ e'^^A* 
Matrices in Jordan canonical form are not quite well-adapted for PageRank. 
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Example 7. Consider the star graph with vertex set {0, 1, ....,n} and edges from to z G [n] 
and viceversa. It is not hard to see that the PWP matrix of inderect influences T is given, for 
i,j e [n], by 

gAVn _|_ g-A-yn ^ ^2k+l gAy^ _j_ ^-X^/n 

= — 24 — ' = ^''^ = 7i ^-^fwTTyr ^''^ = — — • 

Thus the vertex with greater influence and dependence is the vertex 0. PageRank yields a 
similar result making the most dependent vertex. 



6 Final comments 

We introduced the PWP method for counting indirect influences. Applications of PWP to 
real-world networks is currently underway. Our main goal in this note had been to propose an 
alternative route for counting indirect influences. It seems convenient to have a pool of options, 
as well as a comparative study of the various possibilities. We worked with a discrete scenario 
where influences arc transmitted linearly. Lifting those restrictions will conduce to continuous 
non-linear models. This more general setting will be considered elsewhere. 
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